Speaker Independent Phonetic Recognition Using Auditory Modelling and Recurrent Neural Networks

نویسنده

  • P. Cosi
چکیده

Two speaker independent speech recognition experiments, regarding the automatic discrimination of the Italian alphabet I-set and E-set, two very difficult Italian phonetic classes, will be described. The speech signal is analyzed by a recently developed joint synchrony/mean-rate auditory processing scheme and a fully-connected feed-forward recurrent BP network was used for the classification stage. The achieved speaker independent mean recognition rate was 65%, for the Iset and 88% for the E-set showing rather satisfactory results given the difficulty of both tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bimodal recognition experiments with recurrent neural networks

A bimodal automatic speech recognition system, using simultaneously auditory model and articulatory parameters, is described. Results given for various speaker dependent phonetic recognition experiments, regarding the Italian plosive class, show the usefulness of this approach especially in noisy conditions.

متن کامل

Phonetic recognition by recurrent neural networks working on audio and visual information

A phonetic classification scheme based on a feed forward recurrent back-propagation neural network working on audio and visual information is described. The speech signal is processed by an auditory model producing spectral-like parameters, while the visual signal is processed by a specialised hardware, called ELITE, computing lip and jaw kinematics parameters. Some results will be given for va...

متن کامل

Speaker independent bimodal phonetic recognition experiments

A speaker independent bimodal phonetic classification experiment regarding the Italian plosive consonants is described. The phonetic classification scheme is based on a feed forward recurrent back-propagation neural network working on audio and visual information. The speech signal is processed by an auditory model producing spectral-like parameters, while the visual signal is processed by a sp...

متن کامل

Acoustic-phonetic decoding based on elman predictive neural networks

In this paper we present a phoneme recognition system based on the Elman predictive neural networks. The recurrent neural networks are used to predict the observation vectors of speech frames. Recognition of phonemes is done using the prediction error as distortion measure in the Viterbi algorithm. The performance of the neural predictive networks is evaluated on both the training database and ...

متن کامل

Isolated Voiced Digit Recognition Using Inductive Inference

This paper proposes the use of inductive inference "decision trees" for isolated digit recognition. The aim of this research is to demonstrate that inductive learning can provide an alternative approach to existing automatic speech recognition techniques such as Dynamic Time Warping (DP), Hidden Markov Modelling (HMM) and Neural Networks (NN). The construction of the decision tree is based on C...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994